Mechanism for the failure of the Edwards hypothesis in the SK spin glass 
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The dynamics of the SK model at T = starting from random spin configurations is considered. 
The metastable states reached by such dynamics are atypical of such states as a whole, in that the 
probability density of site energies, p(A), is small at A = 0. Since virtually all metastable states have 
a much larger p(0), this behavior demonstrates a qualitative failure of the Edwards hypothesis. We 
look for its origins by modelling the changes in the site energies during the dynamics as a Markov 
process. We show how the small p(0) arises from features of the Markov process that have a clear 
physical basis in the spin-glass, and hence explain the failure of the Edwards hypothesis. 



Complex systems like granular media have a large 
number of metastable (blocked) configurations. When 
shaken or tapped, they quickly relax into another 
metastable state. A subsequent tap will result in an- 
other blocked or jammed state, and so on. The complex- 
ity (entropy) of metastable states in granular systems 
or spin glasses is extensive in the system size. Edwards 
and co-workers have proposed that the quasi-equilibrium 
steady state which results from repeated tapping can 
be described using a thermodynamic measure over the 
metastable states 0, Q . The strongest version of such a 
hypothesis predicts that a system adopts configurations 
which maximize the entropy. In weaker versions parame- 
ters such as the energy or volume are fixed, and the sys- 
tem adopts configurations which maximize the entropy 
consistent with the constraints. 

Edwards hypotheses have met with a high degree of 
success in many complex systems. Some recent exam- 
ples include predicting (i) the distribution of contact 
forces 0, and the effective temperature 0, in simu- 
lations of granular media, (ii) the dynamical entropy 
and correlation functions in the slow-dynamics regime 
of the Kob- Anderson model [f|, and (iii) the distribu- 
tion of steady-state energies in the tapped Sherrington- 
Kirkpatrick model . They seem to be a good approx- 
imation, although not exact, for the zero-temperature 
constrained dynamics of finite-dimensional Ising ferro- 
magnets 0. We note also support in the context of 
the slow dynamics of mean-field spin-glass models, where 
it has been argued that the effective temperature coin- 
cides with the Edwards temperature [a, ID] • The under- 
lying general idea that dynamics does not strongly select 
amongst metastable states is yet more widely used to 
attribute slow dynamics to a proliferation of metastable 
states - in optimization algorithms ||, for example. 

Here we study dynamics in the the canonical SK 
model, for which the metastable states are already well- 
understood 0, ^| . We show that the metastable states 
selected by dynamics are of a very special character in 
which the energy 2Xi to flip the spin at site i has a 
distribution p(X) which is small for A « 0. Generic 



metastable states have p(Q) ^ 0. The dynamically 
selected metastable states are a vanishing fraction of 
the totality of metastable states in the thermodynamic 
limit and therefore, according to the Edwards hypothe- 
sis, should not be expected to be selected. We provide a 
model of the dynamics which explains why it converges 
onto this tiny subset of the metastable states. 

The SK Hamiltonian is H = — ^ii^H^j = 
-|Ei^> where Si = ±1, A l = S l Y J j^ t 'h 3 S j is the 
"site-energy" , equal to one-half of the energy change on 
flipping the spin Si, and indicates a sum over all 

pairs of sites. The interaction strengths Jij are indepen- 
dent random variables from a Gaussian distribution with 
zero mean and standard deviation 1 / y/W. 

We consider the non-equilibrium behavior of the model 
under single-spin relaxational dynamics 0, H2I Il3| . 
starting from a random initial state. We consider the 
T = limit of this dynamics, as in Refs. 0, and Q, 
because it allows the metastable states to be clearly iden- 
tified. Further motivation for studying this limit comes 
from its use in contexts ranging from hysteresis in the 
SK model to domain growth in ferromagnets; it cor- 
responds to the basic Hopfield neural-network algorithm, 
and to the greedy steps in the walk-SAT algorithm 0. 

The state evolves by flipping single spins with Aj < 0, 
i.e., those which are opposed to the local magnetic field 
on their site, until no such spins remain. Different choices 
for the order of spin flips lead to different versions of 
the algorithm. In the "sequential" algorithm a randomly 
selected unstable spin is flipped at each timestep, while in 
the "greedy" algorithm the most unstable (minimum A^) 
spin is flipped. The behavior of these different algorithms 
is remarkably similar [T3|. 

The T = dynamics of the SK model converges 
onto one-spin-flip-stable states, in which every spin aligns 
with its local field. This model is an attractive one in 
which to consider the Edwards hypothesis, because these 
metastable states have been studied analytically p3, Hcj. 
The key results are shown in Fig. in which the cal- 
culated and measured entropy ("complexity") and dis- 
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FIG. 1: Discrepancies between simulations of sequential spin- 
glass dynamics on a system of 5000 spins and the analytical 
predictions based on flat-measure assumptions. Left panel, 
curve (left axis): Complexity of metastable states of energy 
E, E(E) = jf log N s (E) for the SK model, where JV S (E) is the 
mean number of metastable states with energy E. Bar (right 
axis): Histogram of the converged energies for 65 runs of the 
sequential spin-glass dynamics. Right panel, curve: Average 
p(A) in the metastable states of energy —0.7. Histogram: Av- 
erage p(X) over the final states of the spin-glass dynamics. 



tribution of local energies of the metastable states are 
compared. The converged energies do not cluster at the 
peak of the complexity curve, but are instead clustered 
in a narrow range around E « —0.7, so the dynamics cer- 
tainly does not sample the metastable states uniformly. 
Furthermore, the computed p(X) is qualitatively different 
from the flat-average p(X) at the converged energy: The 
computed p(X) has a negligible intercept, whereas the 
flat-average p(X) has a significant finite intercept. Thus 
the dynamics does not uniformly sample the metastable 
states at the converged energy. Furthermore, repeated 
"tapping" of a randomly selected fraction of spins does 
not alter this conclusion: In our simulations the steady- 
state does not develop an intercept. Thus the states 
reached by the dynamics are always qualitatively differ- 
ent from the totality of metastable states of the same 
energy, and in the thermodynamic limit they are a neg- 
ligible fraction of these states. In other situations, it 
has been observed that the blocked states reached by the 
dynamics have different e nerg ies to those typical of the 
blocked states as a whole Our work shows this fea- 
ture too, but furthermore that the dynamically generated 
states are even atypical of the states of the same energy. 

To understand why the typical metastable states are 
not realized we must look to the dynamics. We simplify 
the problem by considering only the population of site- 
energies, {A}, and making the working assumption that 
the evolution of p(X) can be modeled in terms of a Markov 
process in this population. 

The population dynamics is designed to parallel the 
real spin-glass dynamics. At each step an unstable spin i 
is flipped, corresponding to A^ — * —At. In the spin-glass 



X j • i — 2 Sj, Sj 3%i • 



(1) 



Here Si and Sj denote the spin configuration before the 
flip. To obtain a population dynamics we replace the 
drifts AXj.i with functions of the site-energies. In the 
Markov approximation we replace them with indepen- 
dent random variables, whose distribution P(AXj-i\{X}) 
depends only upon the site-energies at each step. 

Similar approaches have previously been applied to the 
SK model 113,113, granular media 0, the walk-SAT al- 
gorithm || , and spin models on random graphs 0, 0] . 
Previous work on the SK model has attempted to cal- 
culate P(AXj-i\{X}). Although this approach has met 
with some success 0, it leads to very involved mod- 
els. Owing to their complexity, these models are only 
tractable numerically, and their physics remains obscure. 
We therefore take a different approach, which is to de- 
termine the general features of P(AAj ; ;|{A}) that suffice 
for a qualitative understanding of the dynamics. 

We can deduce some of the general features of 
P(AXj-i\{X}) directly from QJ. Because the model is 
completely connected, summing the drifts over all the 
unflipped spins gives the sum rule Yljjti = — 2A». 

Therefore, to model the dynamics with a Markov process, 
we must take P(AA J; i|{A}) to have a mean cx 1/N in the 
large- TV limit. Since S 2 — 1, the variance of the drifts is 
then just associated with that of the bond distribution, 



<AA^) - (AA, ;I ) 2 ~ i/N. 



(2) 



Our simulations of the spin-glass dynamics converged 
in < N flips. Since Jy = 0(1/ VN), the third and higher 
cumulants of AXj-i are higher order in 1/N than the mean 
and variance. Therefore the total drift produced by the 
higher cumulants is negligible over the convergence time, 
and we may take P(AXj-i\{X}) to be Gaussian. Any 
correlations between the drifts, AXj-i, and the fields Xj 
would have a qualitative effect on the evolution of p(A). 
We looked for such correlations by taking the states gen- 
erated during the spin-glass dynamics and numerically 
evaluating the drifts when spins are flipped. The results 
are shown in Fig. [21 Each point is the total drift of an 
unflipped spin as a function of its site-energy when all 
spins with site-energies in a small range are flipped. 

Note the general correlation between the drifts and the 
site-energies which can be seen in Fig-El The overall drift 
on flipping a spin i is fixed by the sum rule but it is non- 
uniformly distributed amongst spins according to their 
energies: Highly unstable spins tend to have their site- 
energies strongly increased, at the expense of a reduced 
increase or a decrease in the site-energies of the more 
stable spins. This is physically reasonable because a very 
unstable spin has mostly unsatisfied bonds, while a very 
stable spin has mostly satisfied bonds. Therefore the spin 
i is likely to be connected to a highly unstable spin by an 
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unsatisfied bond, and to a highly stable spin by a satisfied 
bond, producing the observed correlation. 

We now consider whether the general features we have 
identified can explain aspects of the spin-glass dynam- 
ics, in particular the observation that it apparently con- 
verges in ~ N timesteps, to a state with a small inter- 
cept and an approximately linear p(A). We adopt the 
following minimal model, which captures the behavior of 
the distribution p(A) at small A and at late times. We 
make the simplest assumption, that the drift AA J; i in 
the value of Xj resulting from flipping an unstable spin 
i is a Gaussian random variable with mean c/N (c > 0) 
and variance a 2 /N, where, according to Eq. J3J, a 2 = 4. 
This assumption is motivated by the correlations visible 
in Fig. [21 which lead us to expect that the mean drift 
of a low-energy spin is non-vanishing as the converged 
state is approached. Since the assumption of a constant 
drift violates the previously derived sum-rule, it cannot 
be correct for all sites. Our model is designed to address 
the behaviour of p(X, t) at small A. 

The equation of motion for p(X, t) is, for large N, 

_ dp(X,t) a 2 d 2 p(X,t) 
C 8X + 2 dX 2 ' (6) 

where q{t) — J° oc p(X,t)dX is the weight in the negative 
side of the distribution (from which the flipped spins are 
drawn) at time t, and the units of time are such that 
there are N moves per unit time. 

The first term in derives from the flipping process 
Xi — > — A,-, which simply transfers the population from 
negative to positive A at a rate of 1 spin per timestep. 
The second term derives from the mean of the drifts, 
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FIG. 2: Fotal changes in the site energies, £V AXj-i of un- 
flipped spins when all spins with site energies in the ranges 
— 1.0 < Xi < —0.5 are flipped, in configurations generated by 
100 (left panel, 671 flipped spins), and 500 (right panel, 420 
flipped spins) steps of the greedy algorithm on a system of 
5000 spins. This algorithm converged after 2465 flips. The 
straight lines show linear fits to the data. 



which leads, within our model, to a uniform convection 
in the A space. The final diffusion term is due to the 
fluctuations in the drifts. All these processes occur on 
the same timescale, taking ~ N steps, or a time ~ N°, 
to produce an effect of order 1 on p(X, t). 

To understand the solutions to Eq. © we first consider 
the case a 2 — 0. The equation of motion can then be 
solved analytically, to give p{X,t) in terms of integrals 
over p(A,0). The results are shown in the top panel of 
Fig. for a Gaussian initial condition and c = 4. The 
number of spins with A < is always decreasing at a 
finite rate, due to the convection across A = and the 
flipping process. Thus this process certainly converges, 
reaching q(t) = in a finite time. In general the decay 
of p(0,i) near the end of the evolution is linear in time, 
which combines with the convection to produce a linear 
p(X), with no intercept, in the converged state. The slope 
depends on the initial conditions and on c. It diverges as 
c — > 0, where the resulting p(X) is just the half-Gaussian 
created by the flipping. 

For a 2 > 0, we have solved (0) numerically. The result- 
ing p(X, t) are shown in the lower three panels of Fig. 




FIG. 3: The solution to 10, with a Gaussian initial condition 
and c = 4 (top three panels) and 8 (lowest panel), for a 2 — 
(top panel), a 2 = 1 (middle panel), and a 2 = 4 (lower two 
panels). Curves are plotted at time intervals of 0.05. The 
bold curves in the lower two figures are the earliest at which 
p(0,t) « 0.03. They agree with the histograms obtained by 
direct simulation of the population dynamics model with 5000 
fields, shown for c = 8(crosses). 
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For these values of c and a 2 the behavior at early times 
is similar to that with a 2 — 0. The diffusion, however, 
smooths out the singularities (discontinuity of slope at 
A = 0) evident in the a 2 = solutions, and broadens the 
distribution, but the tail of unstable spins continues to 
decay at a significant rate. This can be understood by 
noting that while the positive slope at A = leads to a 
diffusion current back towards A < 0, for these parame- 
ters this current is too small to overcome the loss due to 
flipping and convection. In contrast, if c is too small the 
solution with a = would have a large average slope at 
A = 0, and the diffusion would have a major effect. 

Although for some c and a 2 the early-time behav- 
ior is similar to that of the model with a 2 = 0, we 
see that a new regime appears at later times. As the 
tail of unstable spins becomes narrower, the slope at 
A = increases, while the intercept continues to de- 
cay. This slows the decay of q(t), which obeys dq/dt = 
-1 - cp(0,t) + ia 2 j2) {dp/d\)\x= , with terms due to 
flipping, convection, and diffusion respectively. Indeed, 
in the the lower two panels the slope at A = is approach- 
ing the critical slope of 2 /a 2 at which the diffusion cur- 
rent balances the loss due to flipping. q(t) must continue 
to decay, since the bulk of p(X) will continue to diffuse 
and convect, and by continuity this must reduce the tail 
of unstable spins. However this decay is extremely slow. 
Furthermore, it is an artifact of our low-energy approxi- 
mation, in which we replaced the A-dependent convection 
rate by a constant. In a more complete treatment the bi- 
asing visible in Fig. [3 would tend to confine the bulk of 
the distribution to a region centered onA~ I, due to neg- 
ative convection rates at large A, whereas in the model 
the maximum of the distribution continues to drift to the 
right - see Fig. 

In an infinite system the Markov process and the spin- 
glass dynamics terminate when q(t) — 0. Our numerics 
suggest that this does not occur in a finite time for the 
Markov process, unless a = 0. Hence it is inconsistent 
with the conjecture that the dynamics of the infinite spin- 
glass converges in a finite time. However, for moderate 
values of c the features in p(X) associated with the slow- 
ing of the dynamics become so small that it would require 
a very large system for them to be resolved. Therefore we 
suggest that the minimal Markov model may be adequate 
to understand the convergence seen in the spin-glass sim- 
ulations, which are finite, albeit large. 

In the finite spin-glass the converged p(X) has a small 
intercept, which we can estimate by fitting to histograms 
such as those shown in Fig. ^ For N = 1000 we ob- 
tain an intercept of 0.06, and « 0.03 for N = 5000 and 
N = 10000, consistent with the intercept of 2/^/N sug- 
gested in Ref.^3- This scaling is explained by the Markov 
model, since for p(0) < 1/VN the average diffusion flux 
from positive to negative A is less than the one spin per 
timestep transferred in the opposite direction by the flip- 
ping. The dynamics will rapidly converge after such an 



intercept is reached, with little further change in p(X). 

Based on these arguments and the results for the direct 
simulations of the spin glass, we suggest that the Markov 
process will converge in a finite system when p(0, i) ob- 
tained from Eq. Q becomes comparable with I / y/N. For 
a large enough c, this condition is met before the dynam- 
ics becomes dominated by diffusion, and the resulting 
p(\, t) has some features similar to that of the simula- 
tional result. This can be seen in the lower two panels 
of Fig. |21 where we mark in bold the p(X, t) at which 
p(0, t) ~ 0.03. This corresponds to the smallest intercept 
we have seen in the spin-glass simulations. Direct simu- 
lations of the minimal model in a finite population lead 
to similar distributions. 

To conclude, we have discovered a correlation between 
the energy shifts and site-energies in the spin-glass dy- 
namics, and shown that such a correlation can be suffi- 
cient for the dynamics to converge to a metastable state 
in a large but finite system. Since in the population- 
dynamics approach the converged state will have a nearly 
continous p(X) , while the typical metastable states have 
a discontinous one, the success of a population-dynamics 
approach implies the failure of the flat-measure one. Such 
success is only possible because the population-dynamics 
converges: otherwise spins would flip many times, and 
the Markov approximation would fail. 

These considerations suggest an unusual picture of the 
origins of slow dynamics in some complex systems. Dis- 
order and frustration do play a role, captured by the dif- 
fusion term, in preventing a fast convergence of the dy- 
namics, but this role is limited by the drift. This causes 
the dynamics to converge long before it has time to thor- 
oughly explore the state space, and so the Edwards hy- 
pothesis fails. 

The Edwards hypothesis was shown to correctly pre- 
dict the form of the distribution of steady-state energies 
in simulations of tapping the SK model in Ref. |6J. Given 
our results, this agreement now poses an intriguing prob- 
lem. Perhaps the true dynamical entropy ~Edyn(E) has 
a similar energy dependence to that of the flat-measure 
entropy Y, e d w (E) , so that the energy distributions in tap- 
ping take similar forms. Since the states are very differ- 
ent, however, it is unclear why this should occur. 
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